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• Premise of the study: Microsatellite primers were developed for the tree species Genipa americana (Rubiaceae) for further 
population genetic studies. 

• Methods and Results: We identified 144 clones containing 65 repeat motifs from a genomic library enriched for (CT) 8 and 
(GT) 8 motifs. Primer pairs were developed for 32 microsatellite loci and validated in 40 individuals of two natural G. ameri- 
cana populations. Seventeen loci were polymorphic, revealing from three to seven alleles per locus. The observed and expected 
heterozygosities ranged from 0.24 to 1.00 and from 0.22 to 0.78, respectively. 

• Conclusions: The 17 primers identified as polymorphic loci are suitable to study the genetic diversity and structure, mating 
system, and gene flow in G. americana. 
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Genipa americana L. (Rubiaceae) is widespread throughout 
Brazil and tropical America, both in plantations and in the wild 
(Durigan and Nogueira, 1990). It is a dioecious tree species 
(Mielke et al., 2003) and is used in forest gallery restoration 
(Salvador, 1986) and in mixed plantings of swampy and degraded 
permanent preservation areas, because of its high tolerance to 
flooding (Lorenzi, 2002). As a result of habitat fragmentation 
and damage caused by human activities, the remaining natural 
populations of G. americana currently occur in small forest 
fragments and protected parks. This represents a potentially se- 
rious long-term threat to global biodiversity. Thus, it is neces- 
sary to understand the effects of forest fragmentation on the 
genetic diversity and structure, mating system, and gene flow of 
the species for conservation and sustainable use. Therefore, as- 
sessment of genetic studies using molecular markers is urgently 
needed, particularly for the conservation of genetic resources of 
the remaining populations. For this purpose, the development 
of microsatellite markers for the species is useful, due their high 
level of polymorphism, and will create new opportunities for 
conservation research that can be used to minimize the negative 
implications of population fragmentation. In this study, we cre- 
ated a set of 17 polymorphic microsatellite loci for G. americana 
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that were confirmed to be reproducible in assessing its genetic 
diversity, mating system, and gene flow. 



METHODS AND RESULTS 

Fresh G. americana leaves were collected from a single individual to be 
used in the initial DNA extraction, using the protocol proposed by Doyle and 
Doyle (1990). A microsatellite-enriched genomic library was constructed fol- 
lowing the protocol by Billotte et al. (1999), with slight modifications. Ge- 
nomic DNA was digested with Rsal enzyme (Invitrogen, Carlsbad, California, 
USA) for 3 h incubation at 37°C, and the digested fragments were enriched in 
microsatellite fragments using (CT) 8 and (GT) 8 motifs. Digested fragments 
were then ligated to the double-stranded /foal adapters Rsa21 (5'-CTCTT- 
GCTTACGCGTGGACTA-3') and Rsa25 (5 '-T AGTCC ACGCGT A AGC A A- 
GAGCACA-3') for 2 h incubation at 20°C. Hybridized DNA was captured by 
streptavidin-coated magnetic probe beads (MagneSphere Magnetic Separation 
Products, Promega Corporation, Madison, Wisconsin, USA). The enriched 
fragments were amplified by PCR, the product was cloned into pGEM-T Easy 
Vector (Promega Corporation), and ligation products were used to transform 
Epicurian Coli XL 1 -Blue Escherichia co //-competent cells (Stratagene, Agi- 
lent Technologies, Santa Clara, California, USA). A total of 192 recombinant 
colonies were obtained, and 144 were selected and sequenced in an automated 
ABI 3500xL Genetic Analyzer (Perkin Elmer-Applied Biosystems, Foster 
City, California, USA) using T7 and SP6 primers and the BigDye Terminator 
version 3.1 Cycle Sequencing Kit (Perkin Elmer-Applied Biosystems); 65 of 
these contained microsatellite motifs. Dinucleotide motifs were the most abun- 
dant, followed by mono-, tetra-, tri-, and pentanucleotide motifs (approximately 
80.49%, 10.97%, 4.88%, 2.44%, and 1.22%, respectively). Oligonucleotides 
complementary to the genomic sequences flanking the microsatellite region 
were designed using Primer3Plus (Untergasser et al., 2007) according to the 
following criteria: size of primers preferably between 18 and 22 bp; melting 
temperature (r m ) between 45°C and 60°C; amplified product length between 
100 and 300 bp; and GC content between 40% and 60%. Developed primer 
pairs were assessed using 40 samples by PCR amplification on a Mastercyler 
thermocycler (Eppendorf, Hamburg, Germany) in a final volume of 16 jiL us- 
ing GoTaq Colorless Master Mix (Promega Corporation) containing 6.0 uL 
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Table 1. Characteristics of 32 microsatellite loci developed in Genipa americana: 



Locus 




Primer sequences (5'-3') 


Repeat motif 


Allele size (bp) 


r m (°Q 


GenBank accession no. 


GamOl 


F : 
R: 


CATTCCACATTTGCCCTTG 
GCTTTCCTGTTCCCTAAATCC 


(TC) 16 


175 


60 


KF031152 


Gam02 


F : 
R: 


GCACCAGAGTCTAAAGCCAGA 
TGCACGAGTTCATTGAGATTG 


(CT) 8 (TC) 16 


186 


60 


KF031153 


Gam03 


F : 
R: 


TGAAATTGCCTCTTACACACAC 
AACCACTTCCTTGAACACTGC 


(AC) 9 (AT) 5 


245 


59 


KF031154 


Gam06 


F : 
R: 


CTCTGCGTGTGTGTGTGTGT 
CAAAGACTGTGCGGCATCT 


(CA) 10 


150 


60 


KF031155 


Gam08 


F : 
R: 


ACGGCTTTTCTTGTCTGGTC 
GGCATAATCCAAACCTGCTA 


(CCT) 8 


175 


59 


KF031156 


Gam 10 


F : 
R: 


GGCTCAATGGGTGGCTTA 
TATAAGGGGTCAGAGGAAAACA 


(AC) 8 


150 


60 


KF611785 


Gam 11 


F : 
R: 


AGCCACTACCACCAGTCCAT 
GGAGACCGAGTGTTACATTTCA 


(TC) 5 (AC) 8 


215 


59 


KF031157 


Gam 12 


F : 
R: 


ACGTGTGTGTGTGTGTGTATTG 
TGCCCTCCTATATTTTTCCTCT 


(GT) 8 (GA) 5 


178 


58 


KF611786 


Gam 14 


F : 
R: 


AAGCGTGTTGACAATGCTG 
CTTCAAGCCAGTCACCTTCC 


(CT) 19 (CA) 10 


198 


60 


KF611787 


Gam 16 


F : 
R: 


CCGAGTCCTATTTAGTCGCATT 
CCGCTTTGCTTCTATCACAC 


(TG) 6 (TG) 6 


158 


60 


KF031158 


Gam20 


F : 
R: 


TGTGTGTATGTGTGGGTGTGT 
ACTTGAACCCCTCTCCAAAA 


(TG) 6 


234 


59 


KF031159 


Gam21 


F : 
R: 


TGGGTTGAGTAGGAAGAGAGC 
TTGTTCTCCTTTTGCTCGTG 


(AC) 9 


249 


59 


KF031160 


Gam23 


F : 
R: 


GGAAAAAGCCGAATGTTAGTC 
TGGGAGCATAAAAGCCTAGAA 


(GCAC) 4 (AC) 8 (GA) 29 


249 


59 


T7~ T~?/" 1 1 TOO 

KF61 1788 


Gam24 


F : 
R: 


T G T C C AAAAAC C AG AAAT C 
C AGC T GGAAT AAGAAAAT GA 


(GA) 29 


189 


53 


KF031161 


Gam25 


F : 
R: 


GAAGTTCAGTTAAGACGCATCA 
GAAAAC T GT GGAAGC AAT GG 


(AC) 8 


238 


58 


KF61 1789 


Gam26 


F : 
R: 


ACTACAAAGGCAGGTGGTCTC 
GTGTCGGGAATGAAAGAAACT 


(TC) 5 (CA) 8 




58 


KF61 1790 


Gam27 


F : 
R: 


CCAGCCGTAGTTGAAAAAGA 
ACAAGGCGGTGGAGTTCT 


/ A / A / ATX 

(AC) 6 (AC) 7 (AT) 6 


260 


58 


KF611791 


Gam28 


F : 
R: 


ACTCAGTCAACCTCCGAA 
TACCCAAAAGATTCAGCC 


(AC) 8 


226 


54 


KF031162 


Gam30 


F : 
R: 


TAATGGGGCCTTCAATAAGTT 
CAGGTATGCATTGGAAACACA 


(TG) 6 


241 


58 


t T' t 1 1 1 ^1 C\r\ 

KF6 11792 


Gam31 


F : 
R: 


TTTGGCCAATATTTACTGAT 
TTCAATGGTTCATCCTCC 


(AG) 8 


267 


54 


KF031163 


Gam32 


F : 
R: 


ATGAGGGAACGCTGCCGAAG 
GGTGGAGGAGGGGGAGGAAG 


(CCT) 8 


174 


67 


KF031164 


Gam33 


F : 
R: 


TGTTTCTTTCTCTGCCAT 
GAAAAAGACAACCATACCTT 


(AC) 5 


118 


51 


KF031165 


Gam34 


F : 
R: 


GATGATTGCCGATTAGTG 
TCATTAGTATCCATCCCTTAC 


/ A / A /~*\ / A A A A 

(AG) 9 (AG) 5 (AAAAC) 3 


213 


52 


KF6 11793 


Gam35 


F : 
R: 


T T C TAAC AAAT C C TAT C TAA 
GC T AC T C AC AAAAAT G AT 


(AC) 5 


214 


45 


KFo 11794 


Gam36 


F : 
R: 


TGACTTGGTGCTGTGAGACGAG 
TCAAAATCCTCCCCGCCTT 


(AC) 5 


214 


64 


KFOilloo 


Gam37 


F : 
R: 


GGACGCTAAAGAGCAAGAGG 
CACAGCAAGGATGGAGCA 


(TG) 8 


209 


59 


KF6 11795 


Gam38 


F : 
R: 


GCATAAATGACCTACACA 
GTTGCTACTTTATCCTCTT 


(GAAA) 3 


140 


47 


KF031167 


Gam39 


F: 
R: 


G AAC T C G AT GG AAAT C AG 
CAACTTAATGTGTCTGTAGGA 


(AQio 


132 


51 


KF611796 


Gam40 


F: 
R: 


TGTTCCGTTTGAAACCCTTG 
GAAGCCTAGAACGCATCAGACT 


(TG) 7 


279 


61 


KF6 11797 


Gam41 


F: 
R: 


TCTTGATTACTGGATGACAGAC 
C C T T C AC T GC T T C AC AC AC 


(AC) 10 


132 


55 


KF031168 


Gam42 


F: 
R: 


ATTTGGCTAGCATTTCTTCTCA 
GCATTTTCTTTAGTGGTCGTGT 


(TC) 7 (TG) V (GA) 8 (GA) 5 


185 


58 


KF611798 


Gam43 


F: 
R: 


AGCACCTTACCCCTTTCATA 
GCACTTCATTTTCTTTCATTTC 


(AC) V (AC) 6 (AC) 6 


269 


56 


KF6 11799 



Afote: T m = melting temperature. 

a All values are based on two populations located in Selvma, Mato Grosso do Sul (22°22 , 02 ,, S, 51°25'08"W), and Mogi-Guagu, Sao Paulo (22°17'25"S, 
47°10'55"W), Brazil. 
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Table 2. Results of initial polymorphic micro satellite marker screening in two populations of Genipa americana.* 







Selvfria (N 


= 19) 






Mo 2 


i Guacu (N= 21) 




Locus 


A 


H 0 


He 


F 


A 


H 0 


H e 


F 


GamOl 


3 


0.866 


0.590 


-0.468* 


6 


1.000 


0.783 


-0.277* 


Gam02 


5 


0.789 


0.624 


-0.265* 


5 


0.889 


0.662 


-0.343* 


Gam03 


4 


0.579 


0.652 


0.112* 


5 


0.471 


0.561 


0.161* 


Gam06 


8 


0.750 


0.750 


0 


6 


0.714 


0.728 


0.019* 


Gam08 


4 


0.579 


0.623 


0.070* 


4 


0.400 


0.459 


0.129* 


Gam 11 


4 


0.789 


0.564 


r\ r\ r\f\ ^ 

-0.399* 


7 


0.951 


0.678 


r\ a r\r\ -a. 

-0.402* 


Gam 16 


5 


0.895 


0.642 


-0.394* 


3 


0.842 


0.548 


-0.536* 


Gam20 


5 


0.883 


0.688 


-0.283* 


4 


0.867 


0.676 


-0.282* 


Gam21 


4 


0.555 


0.668 


0.169* 


3 


0.368 


0.468 


0.213* 


Gam24 


8 


0.631 


0.700 


0.098* 


5 


0.476 


0.670 


0.290* 


Gam28 


4 


0.477 


0.560 


0.149* 


4 


0.300 


0.612 


0.510* 


Gam31 


3 


0.850 


0.595 


-0.429* 


5 


0.857 


0.644 


-0.331* 


Gam32 


3 


0.524 


0.629 


0.167* 


5 


0.571 


0.468 


-0.221* 


Gam33 


5 


0.500 


0.650 


0.231* 


6 


0.619 


0.483 


-0.281* 


Gam36 


3 


0.429 


0.581 


0.262* 


2 


0.571 


0.481 


-0.188* 


Gam38 


4 


0.579 


0.640 


0.096* 


4 


0.239 


0.223 


-0.070* 


Gam41 


6 


0.619 


0.671 


0.078* 


4 


0.572 


0.670 


0.147* 


Mean 


4.58 


0.664 


0.637 


-0.043 


4.58 


0.630 


0.577 


-0.091 



Note: A = number of alleles per locus; F = fixation index; H e = expected heterozygosity; H Q = observed heterozygosity; N = sample size for each 
population. 

* Significant departures from Hardy- Weinberg equilibrium at P < 0.05. 

a All values are based on two populations located in Selvfria, Mato Grosso do Sul (22°22'02"S, 51°25'08"W), and Mogi-Guacu, Sao Paulo (22°17'25"S, 
47°10'55"W), Brazil. 



GoTaq Colorless Master Mix, 10 uM of each primer (F and R), 2.7 uL of nu- 
clease-free water, and 7.5 ng of template DNA. The PCR program for all prim- 
ers consisted of 5 min of initial denaturation at 95 °C followed by 30 cycles of 
denaturation at 95°C for 45 s, a primer- specific annealing temperature (Table 1) 
for 1 min, extension at 72°C for 1 min, and a final extension at 72°C for 7 min. 
The amplification products were checked with 3% agarose gels, separated in 
6% denaturing polyacrylamide gels, and visualized by silver- staining. We used 
a 10-bp DNA ladder (Invitrogen) as a size standard for allele scoring. The num- 
ber of alleles per locus (A), the observed (H 0 ) and expected heterozygosities 
(H e ), fixation index (F), and genotypic disequilibrium were estimated for each 
population using the FSTAT program (Goudet, 2002). To test whether the F 
values and the genotypic disequilibrium were significantly different from zero, 
we used 1000 Monte Carlo permutations (alleles among individuals) and a 
Bonferroni correction (95%, a = 0.05). 

For the polymorphism evaluation, we sampled a total of 40 adult G. ameri- 
cana trees from two populations 580 km apart — 21 individuals from a small 
forest fragment in the Ecological Station of Mogi-Guacu (Sao Paulo State, 
Brazil) and 19 individuals from a natural population in Selvfria (Mato Grosso 
do Sul State, Brazil). Of the 32 designed primer pairs, 17 were polymorphic and 
15 were monomorphic in the analyzed populations. In the Selvfria population, 
among the polymorphic microsatellites, the number of alleles ranged from three 
(GamOl and Gam31) to eight (Gam06 and Gam24), with an average of 4.58 
alleles per locus (Table 2). In the Mogi-Guacu population, the number of alleles 
ranged from two (Gam36) to seven (Gamll), with an average of 4.58 alleles 
per locus (Table 2). For the Selvfria population, H 0 and H e ranged from 0.43 to 
0.89 and from 0.56 to 0.75, respectively, and in Mogi Guacu these ranged from 
0.24 to 1.00 and from 0.22 to 0.78, respectively. The Hardy-Weinberg equilib- 
rium was tested in both populations. All loci were in disequilibrium (except the 
locus Gam06 in the Selvfria population and the average populations), most 
likely due to genetic drift caused by forest fragmentation. 



CONCLUSIONS 

The molecular markers described here are the first microsat- 
ellite loci isolated for G. americana. The microsatellite loci 



showed median levels of polymorphism in the two populations 
analyzed. Based on this, our data suggest that the developed 
microsatellite markers may constitute new tools for population 
genetic studies, such as genetic diversity, spatial genetic struc- 
ture, mating system, and gene flow of G. americana. These 
studies will produce valuable information for managing frag- 
mented populations, including information for breeding, con- 
servation, and reforestation plans. 
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